Experiments with Cholesky Factorization on Clusters of SMPs

نویسندگان

  • Siegfried Benkner
  • Dieter F. Kvasnicka
  • Maria Lucka
چکیده

Cholesky factorization of large dense matrices is an integral part of many applications in science and engineering. In this paper we report on experiments with different parallel versions of Cholesky factorization on modern high-performance computing architectures. For the parallelization of Cholesky factorization we utilized various standard linear algebra software packages and present performance results on SMP clusters and shared-memory cc-NUMA machines. Clusters of SMPs can be characterized as hybrid parallel architectures which combine the main architectural features of distributed-memory and shared-memory parallel computers. Although the availability of SMP clusters is increasing rapidly within the scientific computing community, currently no generally accepted programming model exists for these machines. As a consequence, most application developers utilize pure distributed-memory programming models, usually based on the message passing interface (MPI), and thus may miss a number of optimization opportunities offered by the shared-memory available within the nodes of a cluster. In order to address these issues, we have experimented with different parallelization strategies for Cholesky decomposition comparing pure message passing strategies to a hybrid parallelization strategy that combines message passing with shared-memory parallelization based on multi-threading.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Incomplete Cholesky Parallel Preconditioners with Selective Inversion

Consider the solution of a large linear system when the coe cient matrix is sparse, symmetric, and positive de nite. One approach is the method of \conjugate gradient" (CG) with an incomplete Cholesky (IC) preconditioner (ICCG). A key problem with the design of a parallel ICCG solver is the bottleneck posed by repeated parallel sparse triangular solves to apply the preconditioner. Our work conc...

متن کامل

Parallel and fully recursive multifrontal sparse Cholesky

We describe the design, implementation, and performance of a new parallel sparse Cholesky factorization code. The code uses a multifrontal factorization strategy. Operations on small dense submatrices are performed using new dense matrix subroutines that are part of the code, although the code can also use the blas and lapack. The new code is recursive at both the sparse and the dense levels, i...

متن کامل

A Multilevel Block Incomplete Cholesky Preconditioner for Solving Rectangular Sparse Matrices from Linear Least Squares Problems

An incomplete factorization method for preconditioning symmetric positive definite matrices is introduced to solve normal equations. The normal equations are formed as a means to solve rectangular matrices from linear least squares problems. The procedure is based on a block incomplete Cholesky factorization and a multilevel recursive strategy with an approximate Schur complement matrix formed ...

متن کامل

Preconditioning of wavelet BEM by the incomplete Cholesky factorization

The present paper is dedicated to the preconditioning of boundary element matrices in wavelet coordinates. We investigate the incomplete Cholesky factorization for a pattern which includes also the coefficients of all off-diagonal bands associated with the level-level-interactions. The pattern is chosen in such a way that the incomplete Cholesky factorization is computable in log-linear complex...

متن کامل

On Positive Semidefinite Modification Schemes for Incomplete Cholesky Factorization

Incomplete Cholesky factorizations have long been important as preconditioners for use in solving largescale symmetric positive-definite linear systems. In this paper, we focus on the relationship between two important positive semidefinite modification schemes that were introduced to avoid factorization breakdown, namely the approach of Jennings and Malik and that of Tismenetsky. We present a ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005